# Lightweight TTS

Argos 4b 0.2 Es
MIT
A text-to-speech model fine-tuned based on Orpheus-3B, supporting the conversion of text into natural and fluent speech
Speech Synthesis Safetensors Spanish
A
danyw24
327
1
Dia GGUF
MIT
Dia 1.6B is a model suitable for text-to-speech tasks, supporting multiple quantized versions and compatible with the TTS.cpp framework.
Speech Synthesis
D
mmwillet2
156
4
Spark TTS 0.5B 4 6bit
Spark-TTS-0.5B-4-6bit is a text-to-speech model based on the MLX format, supporting both English and Chinese.
Speech Synthesis Supports Multiple Languages
S
mlx-community
59
0
Spark TTS 0.5B Bf16
Spark-TTS-0.5B-fp16 is a text-to-speech model based on the MLX format, supporting both English and Chinese.
Speech Synthesis Supports Multiple Languages
S
mlx-community
121
0
Oute TTS 500M
Apache-2.0
OuteTTS is a text-to-speech (TTS) model focused on the Turkish language, based on a 500M parameter scale, capable of converting Turkish text into natural speech.
Speech Synthesis Other
O
Karayakar
27
0
Canary Tts 0.5b
A Japanese TTS model trained on sarashina2.2‑0.5b‑instruct‑v0.1, supporting quality control via prompts
Speech Synthesis PyTorch Supports Multiple Languages
C
2121-8
244
6
Styletts2 Lite
MIT
A lightweight version of StyleTTS 2, focused on text-to-speech tasks, with multiple components removed to reduce complexity.
Speech Synthesis English
S
dangtr0408
22
0
Orpheus 3b Kaya Q4 K M.gguf
Apache-2.0
A fine-tuned text-to-speech model based on Canopy Labs' pre-trained model, quantized for efficient inference
Speech Synthesis Supports Multiple Languages
O
lex-au
98
0
Kokoro GGUF
MIT
Kokoro is a text-to-speech (TTS) model offering GGUF-encoded versions with dual phonemization support.
Speech Synthesis
K
mmwillet2
749
1
3b De Ft Research Release 4bit
Apache-2.0
This is a German text-to-speech model based on MLX format conversion, supporting German language processing tasks.
Speech Synthesis Transformers German
3
mlx-community
19
0
Orpheus Bangla Tts Gguf 8bit
Apache-2.0
This model is a proof-of-concept fine-tuned version of the Orpheus 3B TTS (Text-to-Speech) model to support Bengali.
Speech Synthesis Other
O
asif00
44
0
Orpheus Bangla Tts Gguf
Apache-2.0
Fine-tuned version of Orpheus 3B TTS model for Bengali, trained with 955 audio samples, suitable for experimental Bengali speech synthesis
Speech Synthesis Other
O
asif00
55
0
Cisimi V0.1
CiSiMi is an early prototype of a text-to-audio model designed for resource-constrained environments and capable of efficient operation on the CPU to achieve advanced speech synthesis.
Speech Synthesis English
C
KandirResearch
202
7
Kokorotts
Apache-2.0
Kokoro is an open-source text-to-speech model with 82 million parameters, delivering sound quality comparable to large models through a lightweight architecture while significantly improving speed and cost efficiency.
Speech Synthesis English
K
Daemontatox
78
0
Kokoro 82M V1.1 Zh
Apache-2.0
Kokoro is an open-weight series of small yet powerful text-to-speech (TTS) models, now featuring data from 100 Chinese speakers sourced from professional datasets.
Speech Synthesis
K
hexgrad
51.56k
112
Kokoro 82M
Apache-2.0
Kokoro is an open-source TTS model with 82 million parameters, delivering audio quality comparable to larger models while offering significant speed advantages and cost efficiency.
Speech Synthesis English
K
prince-canuma
376
2
Kokoro V1 0
Apache-2.0
Kokoro is an open-source text-to-speech model with 82 million parameters, achieving sound quality comparable to large models with a lightweight architecture while improving generation speed and reducing computational costs.
Speech Synthesis English
K
kiriyamaX
18
1
Kokoro 82M Light
Apache-2.0
A clone version based on StyleTTS2-LJSpeech, optimized for English text-to-speech tasks with reduced dependencies for simplified deployment.
Speech Synthesis English
K
ctranslate2-4you
21
8
Kokoro
Apache-2.0
Kokoro is a cutting-edge text-to-speech (TTS) model with 82 million parameters, released under Apache 2.0 license. Ranked #1 in TTS Spaces Arena, achieving higher Elo scores with fewer parameters and data.
Speech Synthesis English
K
geneing
37
16
Kokoro 82M
Apache-2.0
Kokoro is an open-source text-to-speech (TTS) model with 82 million parameters, renowned for its lightweight architecture and high audio quality, while also being fast and cost-effective.
Speech Synthesis English
K
hexgrad
2.0M
4,155
Parler Tts Mini V1 GGUF
Apache-2.0
GGUF format model file of Parler TTS Mini v1 for text-to-speech tasks, supporting the English language.
Speech Synthesis English
P
ecyht2
198
4
Indri 0.1 350m Tts
Indri is a novel, ultra-small, lightweight TTS model based on the Transformer architecture, supporting text-to-speech tasks in English and Hindi.
Speech Synthesis Transformers Supports Multiple Languages
I
11mlabs
1,088
0
Japanese Parler Tts Large Bate
Other
A Japanese text-to-speech model fine-tuned based on parler-tts-large-v1, capable of generating high-quality Japanese speech
Speech Synthesis Transformers Japanese
J
2121-8
114
17
Indri 0.1 124m Tts
Indri is an ultra-compact lightweight TTS model based on Transformer architecture, supporting English and Hindi text-to-speech tasks.
Speech Synthesis Transformers Supports Multiple Languages
I
11mlabs
182
3
Parler Tts Mini V1.1
Apache-2.0
Parler-TTS Mini v1.1 is a lightweight text-to-speech model trained on 45,000 hours of audio data, capable of generating high-quality, natural-sounding speech with controllable features through simple text prompts.
Speech Synthesis Transformers English
P
parler-tts
1,490
19
Parler Tts Tiny V1
Apache-2.0
Lightweight text-to-speech model trained on 45,000 hours of audio data, capable of controlling voice attributes through text prompts
Speech Synthesis Transformers English
P
parler-tts
67
1
Parler Tts Mini V0.1
Apache-2.0
Parler-TTS Mini is a lightweight text-to-speech model trained on 10.5K hours of audio data, supporting voice feature control through text prompts.
Speech Synthesis Transformers English
P
parler-tts
5,430
352
Speech T5 Ur
MIT
Urdu speech synthesis model fine-tuned on the fleurs dataset based on microsoft/speecht5_tts
Speech Synthesis Transformers Other
S
Pak-Speech-Processing
38
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase